On Equivalence of Martingale Tail Bounds and Deterministic Regret Inequalities
نویسندگان
چکیده
We study an equivalence of (i) deterministic pathwise statements appearing in the online learning literature (termed regret bounds), (ii) high-probability tail bounds for the supremum of a collection of martingales (of a specific form arising from uniform laws of large numbers for martingales), and (iii) in-expectation bounds for the supremum. By virtue of the equivalence, we prove exponential tail bounds for norms of Banach space valued martingales via deterministic regret bounds for the online mirror descent algorithm with an adaptive step size. We extend these results beyond the linear structure of the Banach space: we define a notion of martingale type for general classes of real-valued functions and show its equivalence (up to a logarithmic factor) to various sequential complexities of the class (in particular, the sequential Rademacher complexity and its offset version). For classes with the general martingale type 2, we exhibit a finer notion of variation that allows partial adaptation to the function indexing the martingale. Our proof technique rests on sequential symmetrization and on certifying the existence of regret minimization strategies for certain online prediction problems.
منابع مشابه
Concentration Inequalities for Dependent Random Variables via the Martingale Method
Abstract: We use the martingale method to establish concentration inequalities for a class of dependent random sequences on a countable state space, with the constants in the inequalities expressed in terms of certain mixing coefficients. Along the way, we obtain bounds on certain martingale differences associated with the random sequences, which may be of independent interest. As an applicatio...
متن کاملA Trajectorial Interpretation of Doob’s Martingale Inequalities
We present a unified approach to Doob’s Lp maximal inequalities for 1 ≤ p < ∞. The novelty of our method is that these martingale inequalities are obtained as consequences of elementary deterministic counterparts. The latter have a natural interpretation in terms of robust hedging. Moreover our deterministic inequalities lead to new versions of Doob’s maximal inequalities. These are best possib...
متن کاملOnline Regret Bounds for Markov Decision Processes with Deterministic Transitions
We consider an upper confidence bound algorithm for Markov decision processes (MDPs) with deterministic transitions. For this algorithm we derive upper bounds on the online regret (with respect to an (ε-)optimal policy) that are logarithmic in the number of steps taken. These bounds also match known asymptotic bounds for the general MDP setting. We also present corresponding lower bounds. As an...
متن کاملOn joint distributions of the maximum, minimum and terminal value of a continuous uniformly integrable martingale
We study the joint laws of a continuous, uniformly integrable martingale, its maximum, and its minimum. In particular, we give explicit martingale inequalities which provide upper and lower bounds on the joint exit probabilities of a martingale, given its terminal law. Moreover, by constructing explicit and novel solutions to the Skorokhod embedding problem, we show that these bounds are tight....
متن کاملSome Probability Inequalities for Quadratic Forms of Negatively Dependent Subgaussian Random Variables
In this paper, we obtain the upper exponential bounds for the tail probabilities of the quadratic forms for negatively dependent subgaussian random variables. In particular the law of iterated logarithm for quadratic forms of independent subgaussian random variables is generalized to the case of negatively dependent subgaussian random variables.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017